Shifting Attention Using a Temporal Difference Prediction Error and High-Dimensional Input
نویسنده
چکیده
Research on reinforcement learning has increasingly focused on the role of neuromodulatory systems implicated in associative learning. Formulations of temporal difference (TD) learning have gained a great deal of attention due to the similarity of the TD prediction error and the observed activity of dopamine neurons in the primate midbrain. Recent work has attempted to integrate additional neuro-modulatory systems such as noradrenaline and acetylcholine in a TD framework. Additional work has been done to remedy representational issues arising from TD variants that result in incorrect predictions of dopamine activity, as well as to incorporate the TD error signal in models of categorization. In this paper, an actor–critic model incorporating aspects of TD learning and psychological models of attention is described. The development of the model and the behavior of an autonomous agent in a simulated environment are examined and compared with a variant of TD learning lacking an attentional component. The agent learns to behave adaptively due to the shifting of attention to relevant aspects of a high-dimensional input. In contrast, the TD model exhibits perseverative behavior and comparatively slow learning in the same context. It is suggested that real-time models of attention may provide insight into neuromodulatory systems implicated in attention and representational learning.
منابع مشابه
Finite Element Simulation and ANFIS Prediction of Dimensional Error Effect on distribution of BPP/GDL Contact Pressure in PEM Fuel Cell
Distribution of contact pressure between the bipolar plate and gas diffusion layer considerably affect the performance of proton exchange membrane fuel cell. In this regard, an adaptive neuro-fuzzy inference system (ANFIS) is developed to predict the contact pressure distribution on the gas diffusion layer due to dimensional errors of the bipolar plate ribs in a proton exchange membrane fuel ce...
متن کاملMammalian Eye Gene Expression Using Support Vector Regression to Evaluate a Strategy for Detecting Human Eye Disease
Background and purpose: Machine learning is a class of modern and strong tools that can solve many important problems that nowadays humans may be faced with. Support vector regression (SVR) is a way to build a regression model which is an incredible member of the machine learning family. SVR has been proven to be an effective tool in real-value function estimation. As a supervised-learning appr...
متن کاملGlobal Solar Radiation Prediction for Makurdi, Nigeria Using Feed Forward Backward Propagation Neural Network
The optimum design of solar energy systems strongly depends on the accuracy of solar radiation data. However, the availability of accurate solar radiation data is undermined by the high cost of measuring equipment or non-functional ones. This study developed a feed-forward backpropagation artificial neural network model for prediction of global solar radiation in Makurdi, Nigeria (7.7322 N lo...
متن کاملMultivariate Feature Extraction for Prediction of Future Gene Expression Profile
Introduction: The features of a cell can be extracted from its gene expression profile. If the gene expression profiles of future descendant cells are predicted, the features of the future cells are also predicted. The objective of this study was to design an artificial neural network to predict gene expression profiles of descendant cells that will be generated by division/differentiation of h...
متن کاملMultivariate Feature Extraction for Prediction of Future Gene Expression Profile
Introduction: The features of a cell can be extracted from its gene expression profile. If the gene expression profiles of future descendant cells are predicted, the features of the future cells are also predicted. The objective of this study was to design an artificial neural network to predict gene expression profiles of descendant cells that will be generated by division/differentiation of h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Adaptive Behaviour
دوره 15 شماره
صفحات -
تاریخ انتشار 2007